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Abstract 

We obtain new upper bounds on the additive distortion for graph emulators and 
spanners on relatively few edges. We introduce a new subroutine called “strip creation,” 
and we combine this subroutine with several other ideas to obtain the following results: 

1. Every graph has a spanner on edges with additive distortion, 

for arbitrary e G [0,1]. 

2. Every graph has an emulator on edges with additive dis¬ 

tortion whenever e € [0, |]. 

3. Every graph has a spanner on 0(n^+') edges with additive distortion 

whenever e G [0, |]. 

Our first spanner has the new best known asymptotic edge-error tradeoff for additive 
spanners whenever e G [0, |]. Our second spanner has the new best tradeoff whenever 
£ G [p ^]. Our emulator has the new best asymptotic edge-error tradeoff whenever 

£G[o,|]. 

1 Introduction 

A spanner of a graph G is a sparser subgraph of G over the same vertex set that ap¬ 
proximately preserves all pairwise distances between nodes. An emulator of a graph G is a 
possibly weighted graph H on the same vertex set as G such that the pairwise distances in 
H approximate the pairwise distances in G. Researchers try to improve the tradeoff between 
the sparsity of the spanner/emulator and the accuracy with which it preserves the distances 
of the original graph. Spanners were first introduced in the 1980s, where they were used to 
speed up protocols run over unsynchronized networks [3 [ID]. Emulators were introduced 
by Dor, Halperin and Zwick m- Spanners and emulators have since found a wide variety 
of applications, including compact routing schemes [iniiiiiiiiiiiiiii], almost-shortest path 
algorithms [161 [HI [13 US] , distance oracles [2311111123], broadcasting |18| . and many others. 

Much of the initial theoretical work on spanners studied multiplicative stretch spanners, 
that is, all pairwise distances are preserved up to a small multiplicative factor. Althofer et 
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al. [5] discovered that there exist spanners on edges with multiplicative stretch 

2k —1 for all integers fc > 1. This upper bound is tight for multiplicative spanners assuming 
an unproven conjecture of Erdos m- Recent work has focused more on mixed and additive 
spanners. An additive spanner preserves the distances within a small additive error, and 
mixed spanners allow both a multiplicative and an additive error. 

There are three known constructions that produce spanners of constant additive error. 
Aingworth et al. ^ gave a construction for spanners on edges with additive error of 

2, Chechik [9] showed how to construct spanners on edges with additive distortion 

of 4, and Baswana et al. mm gave a construction for spanners on 0{n‘^^^) edges with 
additive distortion of 6 (the runtime of this construction was improved by Woodruff [17]). 
Knudsen [TS] later simplified the constructions of +2 and +6 spanners while obtaining the 
same edge bounds. Dor, Halperin and Zwick m obtain an additive emulator on 
edges and +4 distortion. It remains a major open problem whether or not all graphs admit 
spanners or emulators of constant additive distortion with edges for any e > 0 . 

Notably, Woodruff |26j has proven the existence of a graph family for which any spanner 
of 2 fc — 1 additive distortion has at least edges, so we cannot hope for 0(n) 

edge spanners with constant additive distortion. 

In light of this, some attempts have been made to produce relatively efficient span¬ 
ners below the threshold. Bollobas et al. | 8 ] showed a construction for spanners 
on edges and additive distortion; the distortion was later improved to 

by Baswana et al jS]. Pettie jUj achieved 7 i 9 /i 6 - 7 e /8 distortion, and Chechik 
achieved distortion whenever e G [^, 5 ]- Jointly, these last two spanners form 

the current state of the art. The construction of Baswana et al jS] can be generalized in a 
straightforward way to produce emulators with additive distortion for any e S [ 0 , |] 

(they do not discuss this explicitly in their paper), but no other results for emulators are 
known. 

Our Work We introduce a new subroutine that is useful for spanner/emulator construc¬ 
tion called “strip creation,” and we apply this subroutine to obtain some significantly im¬ 
proved upper bounds on the accuracy/sparsity tradeoff available for spanners and emulators 
below the edge threshold. In Sectionj^ we use this subroutine in a straightforward way 
to produce a spanner on 0(n^+'^) edges with additive distortion. In Sectioniffi 

we merge this idea with some others to achieve an emulator construction with 
additive distortion, so long as e G [ 0 , g], and a spanner of additive distortion 
so long as e G [ 0 , j]. 

Our emulator has the best tradeoff whenever e G [0, |) (it ties the generalization of 
Baswana et al | 6 |, or Chechik’s spanner |9], at e = |). The state-of-the-art asymptotics for 
purely additive spanners are summarized in the following table 0 

^The notation 0(/(n)) suppresses poly log n factors. 

^Some authors consider spanners whose error is a function of d, the original distance between the nodes, 
rather than n; these results are not included in our table. 


2 



Author 

Distortion 

Best Result Range 

Before this paper 

Pettie [22] 

0(„9/16-7./8) 

e e [0, 

Chechik [S] 

0(„l/2-3./2) 


After this paper 

New; Section 

3 

0(„l/2-e/2^ 

e e [0, i] 

New; Section 

4 

0(„2/3-5e/3) 


Chechik [S] 

0(„l/2-3./2) 



Preliminaries All graphs in this paper are undirected and unweighted. The number of 
nodes in all of our graphs is n, unless otherwise stated. For a graph G = {V, E) and u,v G V, 
we denote by Sciu, v) the length of the shortest path in G from u to v. 

We will often refer to the shortest path between two nodes in a graph, when in fact there 
may be many equally short paths. Any shortest path may be chosen for each pair of nodes, 
as long as (1) the choice is consistent, and (2) the paths are as nested as possible - that is, 
any two shortest paths intersect on at most one subpath. We will use this second property 
in our constructions. We will use the notation pq{u,v) to refer to the chosen shortest path 
between u and v in G. 

Given a graph G, spanners and emulators are sparser versions of G that approximately 
preserve the distance between every pair of nodes. More formally: 

Definition 1. A weighted, undirected graph H = {V,E',w) is an (a,/3)-emulator of another 
graph G = (V, E) (on the same vertex set) if, for all nodes u,v G V, we have 

Sg{u,v) < 6h{u,v) < aSG{u,v) + (3. 

Definition 2. An unweighted {a, (3)-emulator of a graph G is called an (a,/3)-spanner of 
G if it is a subgraph of G. 

When 0 = 1, we say that the spanner/emulator is additive with distortion (3, and when 
/3 = 0, we say that the spanner/emulator is multiplicative with stretch a. If neither of these 
is the case, then the spanner/emulator is mixed. 

2 Strip Creation 

In this section, we will describe our main subroutine, called “strip creation.” 

Graph Clustering This is an important auxiliary subroutine. A very similar subroutine 
to this one has been used in many different spanner and emulator constructions (see [131 [9] 
for example). The input is a graph G = {V, E) and an integer e, and the output is a partial 
partitioning of V into “clusters.” The algorithm works as follows: 
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Algorithm 1: cluster (G,e) 


Data: A graph G = {V, E) and an integer e 


1 

2 

3 

4 

5 

6 

7 

8 


Initialize C = 0; 

Unmark all nodes; 

while there is an unmarked node u with at least e — 1 unmarked neighbors do 
Initialize C to be the set u plus any e — 1 of its unmarked neighbors; 

Mark all nodes in C; 

Add C to C 

end 

return C; 


Intuitively, we think each G € C as a “cluster” of nodes. Each cluster contains a node 
at distance one from all other nodes; this is called the “cluster center.” The subroutine of 
generating these clusters will be called cluster (G, e). The subroutine of picking out the 
center of a given cluster will be called center (G). A node is clustered if it belongs to some 
C G C and unclustered otherwise. 

Our version of graph clustering differs slightly from the typical clustering algorithm used 
in other spanner constructions. We produce clusters that necessarily have at least e nodes 
each (this property will be important later). The other important feature is that there are 
at most ne edges in the graph between unclustered nodes: 

Claim 1. Let C be the output o/ cluster (G, e). There are at most ne edges in G with both 
endpoints unclustered in C. 

Proof. First, note that at termination of the cluster subroutine, no node can have at least 
e — 1 unmarked neighbors: if it did, then we could turn it into a new cluster and add it 
to C. Therefore, we have at most e — 1 edges with both endpoints unclustered incident on 
any given unclustered node. We have at most n unclustered nodes in total, and the claim 
follows. □ 

Strip Creation This is our main subroutine. We will add a carefully-chosen set of short¬ 
est paths to the spanner that collectively have some convenient coverage properties. The 
subroutine works as follows: 
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Algorithm 2: createStrips(G',C, d, m) 

Data: A graph G = {V,E), a clustering C of G, and integers d, m. 

1 Initialize a set 5 = 0; 

2 while there exist u,v G V such that the following properties all hold: 

1. Sg{u, v) < d 

2. pg{u,v) intersects at most m different paths in S 

3. pg(u,v) intersects exactly m clusters that are disjoint from all paths in S 

3 do 

4 I Add pg{u,v) to 5; 

5 end 

6 return 5; 


The paths in S are the strips of the graph. This subroutine will be called createStrips(G, C, d, m). 

One property that makes this subroutine useful is that it is very cheap to add the edges 
in S to our spanner. This is our first lemma. 

Lemma 1. The set S contains only 0{n) edges that are incident on a clustered node. 

Proof. When we add a new path pg{u,v) to 5, we sort its edges (a, 6) into the following 
cases: 

1. The nodes a and b are both unclustered: this edge is not incident on a clustered node, 
so we can ignore it. 

2. There is no edge already in S that touches a (or the same condition holds for b): there 
are 0{n) of this type of edge in total. 

3. There is an edge in S that touches a and another edge in S that touches 6, but no 
edge (a, 5): this type of edge must go between two strips that are already in S. Since 
two shortest paths intersect on at most one subpath, there can be at most two of these 
edges per path in S that intersects pg{u,v). Therefore, the number of this type of 
edge in pg{u,v) is at most 2m. There are at most ^ paths added to S in total (since 
each one must be the first to touch m new clusters), so the total number of this type 
of edge in the graph is 0{n). 

4. The edge (a, b) is already in S: this does not add a new edge to 5, so we can ignore 
it. 


□ 

Adding the edges of a strip set to our graph gives it some convenient connectivity prop¬ 
erties. This is the subject of our next two lemmas. 

Definition 3. A cluster is clean if it does not share a node with any strip S € S. 

Althofer et al. [2] describe an efficient algorithm that generates spanners on 
edges with multiplicative stretch 2A: — 1. We will employ this algorithm as a subroutine 
several times throughout this paper; for simplicity, we will always set k = logn. This gives 
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US a spanner on 0(n) edges with 0(1) multiplicative stretch. The subroutine of generating 
such a spanner will be called multspan(G). 

Lemma 2. Let H be a subgraph of G over the same vertex set, and suppose H contains 
exactly the edges o/multspan(G) and createStrips(G,C, d, m). Let u, v be nodes in G with 
the property that pq{u,v) intersects at most k strips and at most k clean clusters. Then 
Ph{u, v) < pg{u, v) + d{k). 

Proof. Construct a new path P as follows: 

1. Start at u. 

2. Repeat until you reach v: 

(a) Walk down pg{u,v) until you encounter a node x that belongs to a non-clean 
cluster G. Let S' be a strip that intersects G. Let s € S C G. 

(b) Travel the shortest path from x to s. 

(c) Walk along S until the last cluster G' intersected by both S and pg{u,v). Let 
s' € S n G'. Let x' G pg{u, v) n G'. 

(d) Travel the shortest path from s' to x'. 

(e) Walk along pg{u,v) until you exit G'. 

First, we will argue that P is only 0(fc) longer than pg(u,v). Each time P departs from 
Pg(u,v), we travel distance at most two (from x to s, which belong to the same cluster), 
and when P rejoins pg{u,v) we again travel distance at most two (from s' to x', which 
again belong to the same cluster). Let a G pg{u,v) be a node immediately before one of 
these departures, and let b G pg{u,v) be a node immediately after one of these departures. 
It follows from the triangle equality that |P[a O 6 ]| < ^ 0 ( 0 , b) + 8 . Since we assume there 
are at most k departures in total, this implies that |P| < 5G{a, b) + 8k. 

Second, we will argue that only 0(fc) edges in P are missing from H. To see this, each 
edge (a, b) G P falls into one of the following cases: 

1. The nodes a and b are both unclustered: this edge is present in H. 

2 . At least one of the nodes belongs to a clean cluster: this edge might not be present in 
H. We have assumed that pg{u,v) intersects at most k clean clusters, and therefore 
P intersects at most k clean clusters as well. P intersects any clean cluster on at 
most four edges: any clean cluster it intersects also intersects pg{u,v), and if pg{u,v) 
intersects two nodes of the cluster, then they are at distance at most 2 and hence can 
have at most one node between them on pg{u,v). Therefore, there are at most 4fc of 
this type of edge in total in P. 

3. At least one of the nodes belongs to a non-clean cluster, but {a,b) is not contained 
in any strip: this must be one of the edges immediately preceding or immediately 
following a departure of P from pg{u,v). As discussed above, there are at most four 
of these edges per departure. There are at most k departures, so there are at most Ak 
of this type of edge in total in P. 

4. The edge (a, b) is contained in a strip: this edge is present in H. 
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Therefore, at most 8k edges in P are missing from H. For each missing edge (a, b), we know 
from edges added in the multspan subroutine that pH{a,b) = 0(1). From this, we can 
conclude |P| < pg(m, v) + 0 (fc), so ph{u, v) < pg{u, v) + 0{k). □ 

Lemma 3. Let H be a subgraph of G over the same vertex set, and suppose H contains 
exactly the edges o/ multspan(G) and createStrips(G,C, d, m). Let u,v be nodes in G 
with the property that pg{u,v) intersects exactly k clean clusters and fewer than | strips. 
Then5G{u,v)>Ll{^). 

Proof. Partition pg{u,v) into — sections, where each section contains exactly m clean clus¬ 
ters (the last section might contain fewer). For each of these sections, we evidently chose not 
to add this subpath P of pg{u,v) as a new strip. That means that either (1) P intersects 
at least m clusters, or (2) P has length at least d. The former can only be the case at most 
half the time, otherwise pg{u,v) intersects at least | strips. Therefore, at least half of these 
sections have length at least d, so the total length of pg{u,v) is at least □ 

3 First Spanner Construction 

Here we will prove the following theorem: 

Theorem 1. For any parameter e G [0,1], there exists a spanner on 0(71^+*^) edges with 
additive distortion 0(n^/^“'^/^). 


Main Construction Our first spanner construction is as follows: 


Algorithm 3: 0(n^/^ “^Z^) additive distortion spanners on 0(n^+'^) edges 
Data: An unweighted, undirected graph G = {V, E) and a number e 

1 Initialize H — multspan(G); 

2 Initialize C — cluster(G, n*^); 

3 Initialize iS ^ createStrips(G,C, 00 , // (G,C,d,m) 

4 Add all edges in 5 to id; 

5 Add all edges to FI whose endpoints are both unclustered in C; 

6 return ii; 


Our edge bound for this construction is very straightforward. The multspan subroutine 
adds 0{n) edges, the createStrips subroutine adds 0(n) edges, and since every unclustered 
node has at most unclustered neighbors (otherwise we could have turned these into a 
new cluster), there are at most edges with both endpoints unclustered. 

We will now prove our error bound. 

Claim 2. Any shortest path pg{u,v) intersects at most clean clusters. 

Proof. If pg{u,v) intersected more than clean clusters, then we would add one of 

its subpaths as a new strip before terminating the createStrips subroutine. □ 



Proof. First, note that there are at most strips in S, since each strip intersects 

f^i/ 2 -e /2 previously clean clusters, and there are at most clusters in total. The error 
bound is now immediate from Lemma [21 □ 
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4 Second Spanner Construction 

We will prove the following results: 

Theorem 2. Every graph has an emulator on edges with additive 

distortion, whenever e G [0, |]. 

Theorem 3. Every graph has a spanner on edges with additive dis¬ 

tortion, whenever e G [0, ^]. 

Hitting Sets We will need one last subroutine before we proceed with the construction. 
Let V = {1,..., n} be a set of nodes, and let 7^ be a set of subsets of V. We say that H cV 
is a hitting set of TZ if for every R G TZ, there is a node h G H D R. The following result is 
well known: 

Lemma 4. Let m be the minimum size of any element R G TZ. Then TZ has a (polynomial¬ 
time eonstructible) hitting set H with \H\ = 0(^ log(|7?,|). 

It can be shown that the greedy algorithm, which repeatedly selects the node h that hits 
the most un-hit sets in TZ, will suffice to implement this lemma. It can also be shown that a 
sufficiently large random sample of nodes will implement this lemma with high probability. 
We will not prove this here; instead, we will simply use the notation hittingSet(7?.) as a 
subroutine that constructs a hitting set of TZ of the asymptotic size bound given in the above 
lemma. 

Subset Spanners A fc-additive subset spanner of a graph G = {V, E) and a subset S GIV 
is a subgraph H oiG such that for every s G S,v gV has 5g{s, v) < Snis, v) < 6 g{s, v) -\- k. 
That is, subset spanners are only required to approximate the distances between S and 
the rest of the graph. Cygan, Grandoni and Kavitha [I2j obtain a subset spanner on 
0(n(|S'| logn)^/^) edges and distortion 21ogn. We refer to the procedure constructing such 
a subset spanner for a graph G and subset S as subsetspan(G, S). 

Main Construction This time, we will have two parameters in our construction: A and 
ji. For the emulator construction, we set /r = ^ — |e and A = | — |e. For the spanner 
construction, we set ^ | — |e and A = | — |e. Note that we have the constraint ^ > 0. 

This gives rise to the restrictions e G [0, |] for the emulator and e G [0, |] for the spanner. 


Algorithm 4: Algorithm that implements Theorems and 
Data: A graph G = (V, E) and a number e 

1 Initialize H ^ multspan(G'); 

2 Initialize C ^ cluster(G, n*^); 

3 Initialize iS ^ createStrips(G,C, n^) // {G,C,d,m) 

4 Add all edges in 5 to if; 

5 Add all edges between unclustered nodes to H-, 

0 Initialize Q ^ 0; 

7 for every ordered pair of nodes u,v such that pc{u,v) intersects at least ^ strips 
or at least clean clusters do 

8 

9 

10 

11 
12 

13 

14 

15 

16 end 

17 Initialize T ^ hittingSet(Q); 


Let X be the first node on pq{u,v) such that pa{u,x) intersects at least ^ 
strips or at least clean clusters; 
if pg{u,x) intersects at least ^ strips then 

Initialize Q to be the set of all nodes within distance two of any of these 
strips; 

Add Q to Q; 
else 

Initialize Q to be the set of nodes in pq{u,x)] 

Add Q to Q; 

end 


Algorithm 4: Emulator Continuation (Theorem]^ 

18 for every pair ti,t 2 do 

19 I Add an edge to H between G and t 2 of weight 5 g(G,G) ; 

20 end 

21 return iL; 


Algorithm 5: Spanner Continuation (Theorem]^ 

18 Add to H all edges in subsetspan(G, T); 

19 return H; 

The main for loop in this algorithm omits pairs it, v with the property that pa (it, v) 
intersects less than ^ strips and less than clean clusters. We can ignore these paths 
because we already have enough edges in place to span these paths within distortion 0{n^) 
(this is immediate from Lemma [^. 

We will now prove our certificates for this construction. 

Claim 4 (Edge Bound). In either case, the returned graph H has 0(n^+'^) edges. 

Proof. Our first goal is to place a lower bound on the minimum size of any element Q G Q. 
We create these elements Q in lines and In the first case, we are adding the nodes 
within distance two of r2(n'^) different strips. Each of these strips was the first to intersect 
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distinct clusters, each of which contains at least n'^ nodes. Therefore, the size of each of 
these elements is In line 13 we add all of Sg{u,x) to Q. We know that this 


path intersects at least clean clusters and at most ^ strips. By Lemmawe conclude 
that this path has length at least 

By Lemma 1^ the size of T is at most p^j. emulator 

parameter settings, this gives |T| = so we add |Tp = 0(n^+'^) edges to H in 

the emulator continuation. For the spanner parameter settings, this gives |T| = 0(n^'^). 
Then subsetspan(G,T) has 0(n|T|^/^) = edges. □ 


Claim 5 (Error Bound). The returned graph H spans/emulates G with 0(n^) 
distortion. 


additive 


Proof. Let u,v be arbitrary nodes in V. First, suppose that pg(u, u) intersects fewer than 
^ strips and fewer than clean clusters. Then by Lemma 2 we already have Ph{u, v) < 
Pg{u,v) + d ( n ^). 

Otherwise, let Xu be the first node on pg{u,v) such that pGiu,Xu) does not have this 
property, and let Xy be the same for pg{v,u). First, suppose pg{u,Xu) intersects at least 


strips. Then there is some node € T on one of these strips (Line 


10). Otherwise, 


there is some node t^ G T that sits directly on the path pg(u,Xu) (Line ISiTLet u>„ be the 
closest node on pg{u,Xu) to Define ty,Wy similarly with respect to v. 

We now proceed by the triangle inequality. We have 

SGitu,tv) < 5G{tu,Wu) + 5G{Wu,Wy) + SG^Wy^ty) 

We know that 8G{tmWu) and 5G{wy,ty) are both 0{n^), since ty (ty) is within distance 
two of a strip that intersects pg{u,Xu) {pG{xy,v)). We can then write 

5GiWu,tu) + 5G{tu,ty) + SG{ty,Wy) < 5G{Wu,Wy) + 0{n^) 

Some algebra yields 

(5g(u, Wu)+SGiwu, tu)+SGitu, ty)+5G{ty, Wy)+6Giwy,v) < SGiu, Wu)+Sg{wu, w„)+(5g(w«, n)+0(n^) 

Since Wu,Wy G pg{u,v), the right-hand side simplifies. 

5g{u,Wu) + SG{'Wu,tu) + SG{tu,ty) + SGity,Wy) -I- SGiwy,v) < (5g(m,w) + 0{n^) 

It follows from Lemmaj^that 6h{u,Wu) < 5g{u,Wu) +0{n^) and SH{wy,v) < 6G{wy,v) -I- 
0{n^). We also know that (5 g('!c«, tu) < and 6G{tv,Wy) < n^. Additionally, we know 
that Snitu, ty) < 5G{tu, ty)+2, because we have added a subset spanner (or a direct emulator 
edge) that enforces this property on every pair of nodes in T. This gives us 

6h{u,Wu) -b Sniwy^ty) + Snity^ty) + 5H{ty,Wy) + SH{wy,v) < SGiu,v) -b O(n^) 

We finish by applying the triangle inequality to the left-hand side. 

Sh{u, v) < 5g(w, v) + d{n^) 


□ 


It is worth noting that better spanners would follow quickly from better subset spanners. 
It is not yet known how to generalize the subset spanner result in |12j to fewer edges and 
a non-constant error bound; for our purposes, any error of up to would be acceptable. 
This seems to be an attainable and relevant problem, which we leave open. 
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5 Conclusion 


We have improved the additive distortion bounds for spanners and emulators for many 
values of e below the edge threshold. The main open question of spanner research still 
remains: do there exist spanners or emulators on edges with additive distortion 

for some constant <5 > 0? 

Our work also suggests that further improvements in spanner construction may follow 
from generalizations of the recent work in subset spanners. More formally: can we construct 
a subset spanner on an asymptotically smaller number of edges, given a subset size of [S'! 
and a budget of n‘^ additive distortion? 
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